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ABSTRACT 

We a dapt the smooth tests of goodness of fit developed by iRavner fc Best! (1989, 
1990) to the study of the non-Gaussianity of interferometric observations of the cos- 
mic microwave background (CMB). The interferometric measurements (visibilities) are 
transformed into signal-to-noise eigenmodes, and then the method is applied directly 
in Fourier space. This transformation allows us to perform the analysis in different 
subsets of eigenmodes according to their signal-to-noise level. The method can also 
deal with non-uniform or incomplete coverage of the UV plane. We explore here two 
possibilities: we analyse either the real and imaginary parts of the complex visibilities 
(Gaussianly distributed under the Gaussianity hypothesis) or their phases (uniformly 
distributed under the Gaussianity hypothesis). The power of the method in discrim- 
inating between Gaussian and non-Gaussian distributions is studied by using several 
kinds of non-Gaussian simulations. On the one hand, we introduce a certain degree of 
non-Gaussianity directly into the Fourier space using the Edgeworth expansion, and 
afterwards the desired correlation is introduced. On the other hand, we consider in- 
terferometric observations of a map with topological defects (cosmic strings) . To these 
previous non-Gaussian simulations we add different noise levels and quantify the re- 
quired signal-to-noise ratio necessary to achieve a detection of these non-Gaussian 
features. Finally, we have also studied the ability of the method to constrain the so- 
called nonlinear coupling constant Jnl using \ 2 simulations. The whole method is 
illustrated here by application to simulated data from the Very Small Array interfer- 
ometer. 

Key words: methods: data analysis - methods: statistical - cosmic microwave radi- 
ation 



1 INTRODUCTION 

The study of the probability distribution of the primordial 
density fluctuations in the Universe is one of the most fun- 
damental challenges in the present day cosmology. The de- 
termination of this probability distribution could constrain 
the ensemble of cosmological theories of the formation of 
primordial density fluctuations. In particular, standard in- 
flationary theories establish that these d ensity fluctuations 
are Gaussianly distributed jGuthlll98ll) . So the detection 
of some non-Gaussian features would question these kinds 
of theories. A powerful tool in the determination of cos- 
mological parameters is the cosmic microwave background 
(CMB). The primordial density fluctuations left their im- 
print in the CMB because of the thermal equilibrium exist- 

* e-mail: aliaga@ifca.unican.es 



ing before matter-radiation decoupling. In this manner, the 
initial density fluctuations led to anisotropies in the CMB, 
and these inherited the probability distribution of the for- 
mer. In short, testing CMB Gaussianity is equivalent to test- 
ing the Gaussianity of the primordial fluctuations and hence 
the validity of standard inflation. 

For these reasons there is great interest in the imple- 
mentation of statistical methods to analyse the Gaussianity 
of current and future CMB data. A representative sample 
of these methods (containing real-space statistics, wavelets 
and Fourier-based analyses) have been recently applied to 
prob e the Gaussianity of the data from the WMAP mis- 
sion l|l ^matsuj3t_alj200^ alJ 
l2004allbUVielva et alJl2004 |Parkll2004t ICruz et al]l2004T) . 

In this paper, we shall co ncentrate on the smooth test s 
of goodness of fit developed bv lRavner fc BestJ (119891119901) . 
These methods have been already adapted to CMB data 
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analysis, and applied JCavon et alJ l2003bl) to the study 
of the Gaussianity of MAXIMA data jBalbi et al.l Eooot 
lHananv et"all f200(]1. We further adapt this method here to 
deal with data from interferometers. 

With their intrinsic stability (only the correlated sig- 
nal is det ected) and their abili t y to reject atmospheric sig- 
nals (e.g. iMelhuish et alJlTool lLav fc Halversonll2000h in- 
terferometers have proved their worth in ground-based ob- 
servations of the CMB. Over the last few years several in- 
terferometers have provided high-sensitivity measurements 
of the power spectrum at intermediate angular scales. In 
particular, these experiments include t he Very Small Ar - 
ray CVS A IWatson et al]l20 03h. DASI ijLeitch et al]l2002l) 
and CBI (Padi n et al.M200lfl : for a more detailed compila- 
tion of interf e romet ric experiments measuring the CMB, e.g. 
IWhite et al.l il99 9). These instruments sample the Fourier 
modes of the measured intensity, so they provide a direct 
measurement of the power spectrum on the sky and allow 
us to study the Gaussianity directly in harmonic space. 

Non- Gaussianity analy ses have been already performed 
on in terferometer datasets l|Savage et ahlEooS Ismith et alJ 
l2004f) . In the first case these analyses are mainly based 
on real-space statistics (they are applied to the maximum- 
entropy reconstructed maps in real space); while in the sec- 
ond case, the analysis is performed in Fourier space by com- 
puting the bispectrum in a similar way to the computation 
of the power spectrum. We follow the second approach and 
present here the smooth tests of goodness-of-fit adapted to 
the study of the Gaussianity of interferometric data directly 
in the visibility space, so we test directly the Gaussian na- 
ture of the Fourier modes on the sky. 

Since an interferometer measures complex quantities 
(visibilities) , we present two different analyses in this paper, 
one on the real and imaginary parts, and another on the 
phases. The experimental data are a combination of the sig- 
nal plus noise. In principle, it would be desirable to analyse 
data in which signal dominates over the noise; that is, data 
with a high signal-to-noise ratio. One suitable approach to 
select only data with high signal-to-noise ratio is the trans- 
formation of the data into signal-to-noise eigenmodes teondl 
Il995l) . This formalism allows us to eliminate the data dom- 
inated by noise. Moreover, as we explain below, the signal- 
to-noise eigenmodes are not correlated, allowing the direct 
implementation of the method. 

As it is shown below, our method looks for devia- 
tions with respect to the null hypothesis (Gaussianity) in 
statistics related to the measured moments (mean, variance, 
skewness, kurtosis, ...) of the visibility signal-to-noise eigen- 
modes. Thus, it is clear that the test is specially tailored for 
detecting non-Gaussian signals directly in the Fourier do- 
main. This Gaussianity analysis of the Fourier components 
completes the analysis of Gaussianity in the real space, given 
that some non-Gaussian detection in the Fourier space could 
indicate some degree of non- Gaussianity in the real space 
and vice versa (a linear transformation of Gaussian vari- 
ables is also Gaussian). In fact, for example, the analysis of 
Gaussianity directly in the Fourier space by means of the 
bispectrum has demonstrated to be a powerful tool in the 
detection of the /nl parameter jKomatsu fc Sperge]||200ll 
iKomatsu et al.l I2003T) . Then, the analysis in Fourier space 
could detect non- Gaussianity that, in principle, should be 
more evident in real space. As an illustration of this fact, we 



will show how our method detects the non-Gaussianity of a 
cosmic strings map directly in Fourier space, even though it 
seems more natural to search the non- Gaussian features of 
these objects in real space. 

The organization of the paper is as follows. Smooth tests 
of goodness-of-fit are introduced in Section |2] W e focus our 
intere st in the statistics developed bv lRavner fc Best! (Il989l . 
Il990h and their application to the Gaussian and uniform 
distributions that we are going to test. In Section |3] signal- 
to-noise eigenmodes are reviewed and applied to the case 
of interferometer observations. The simulations we are us- 
ing are described in Section 0] where the Very Small Ar- 
ray (VSA) is taken as a reference experiment. The appli- 
cation to Gaussian simulations is described in Section [S] 
The power of the test in discriminating between Gaussian 
and non-Gaussian signals affected by Gaussian noise is stud- 
ied in Section |S| We consider three kinds of non-Gaussian 
simulations: those performed using the Edgewort h expan- 
sion introduced bv lMartmez-Gonzalez et alJ i2002l) . a string 
simulation created bv TBoTichBt"^tHuT ' i^8^ and \ 2 simula- 
tions with a /jvl coupling parameter. Finally, Section Q is 
dedicated to discussion and conclusions. 



2 SMOOTH TESTS OF GOODNESS-OF-FIT 

In this section we present the smooth tests of g oodness-of- 
fit and the work of iRavner fc Bestl Jl989l Il99di5 applied to 
a Gaussian and a uniform variable. 

Let us suppose n independent realizations, {xi}, of a 
statistical variable x (i — l,...,n). Our aim is to test 
whether the probability density function of x is equal to a 
prefixed function f(x). The smooth tests of goodness-of-fit 
are constructed to discriminate between the predetermined 
function f(x) and another that deviates smoothly from the 
former and is called the alternative density function. We con- 
sider an alternative density function given by f(x, 0), where 
is a parameter vector whose ith component is 9i and there 
exists a value do such that f(x,Go) — f(x). The alternative 
function deviates smoothly from f(x) when is displaced 
from 80 . 

The probability density function of the n independent 
realizations of x is given by \Yi=i f( Xi > Given these real- 
izations, we calculate the estimated value of 6 by means of 
the maximum likelihood method and denote this value by 
6. The quantity W is defined such that 




nr=i/(^ e °)' 



where W is a measurement of the difference between 6 and 
60 and is therefore a test of the hypothesis that 6 = 6q. 
Assuming 6 to be close to Gq and n to be very large (n — > 
00), the quantity W is equal to the so-cal led score statistic 
JCox fc Hinklevlll974l:lAliaga et al.ll2003bl) . 

S = U*(0 )r 1 (p a )U(0 ), (1) 

where U(9) is the vector whose components are Ui(G) = 
d£({xj},G)/d8i, with the log-likelihood function defined as 
£({xj},0) = log Y[ H = i f( x h U % is the transposed vec- 
tor of U and the components of the matrix I are equal to 
ly(0) = (Ui{0)Uj(fi)). 
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Among all the possible choices of alternative den- 
sity functions, we select t hat presented in the work of 
iRavner fc Bestl l|l98flt ll£ 



2.1 Rayner-Best Test 

In the work of IRavner fc Besd <198' 
density function is defined as 



between /I3 and S3. Then, the data have a large skewness 
value. This holds for all distributions and not only for the 
Gaussian case. 

These e x pressio ns (or the equivalent Sk) are used in 
ICavon et ail (I2003bh to test the Gaussianity of the MAX- 
IMA data. Th e skewness and the k urtosis of these data are 
an alternative constrained in lAliaea et alJ (l2003af) using the same method. 



g k (x) = C(e 1 ,...,6 k )expl J^WO \f( x ) 



where C(8i, . . . , Ok) is a normalization constant and the hi 
functions are orthonormal on / with ho(x) — 1 (note that 
do — 0). It can be demonstrated that the score statistic 
associated with the k alternative is given by 



with 



Ui = 



Em 

j=i 



(2) 



Note that the previous Ui quantity is not the ith com- 
ponent of the vector U in expression Q (although they are 
closely related) . We denote the two quantities with the sam e 
letter to follow the notation of IRavner fc Besd (Il989l.ll99cl) . 
As equation Q shows, the expression @ gives the score 
statistic only when the distribution of the data is f(x) (be- 
cause the condition = 80 is imposed). In this case (that 
is, when the distribution of the data is equal to the prefixed 
one), the Sk statistic is distributed as \\ when n — > 00. This 
holds because Ui is Gaussianly distributed when n — *• 00 
(the central limit theorem). Moreover, it is easy to prove 
that the mean value of Uf is equal to unity, independently 
of the value of n. We see that we can work with the Sk 
quantities or alternately with Uf (if the distribution of the 
data is equal to f(x), these are distributed following a Xi 
function) . 

In this paper we apply these statistics to two cases: on 
one hand, to the real and imaginary parts of the visibilities 
and on the other hand to the phases of the visibilities. In 
the first case, the function f(x) is a Gaussian distribution, 
and in the second case the distribution is a uniform. 



2.2 Gaussian Variable 

If f(x) is a Gaussian distribution with zero mean and 
unit variance, the hi functions are the normalized Hermite- 
Chebyshev polynomials hi(x) — Pi(x)/si with Si — Vil, 
Po(x) = 1, Pi(x) — x, and, for i > 1, Pi+i(x) = xPi(x) — 
iPi-i(x). The Uf statistics are given by: 



ut 

ui 
ui 



n(fi2 - 
n[(fi4 



1)72 
3/ii 
-3)- 



> 2 /6 
- 6(/t 2 



l)] 2 /24 



where jx a — Q2j=i x< j)/ n IS t ne estimated moment of or- 
der a. Bearing in mind the relation between the Uf and the 
Sk quantities, we see that the statistic Sk is related to mo- 
ments of order < k, so that this test is directional; that is, 
it indicates how the actual distribution deviates from Gaus- 
sianity. For example, if Si and S2 are small and S3 is large. 
The data then have a large (13 value because of the relation 



2.3 Uniform Variable 

The smooth test of goodness-of-fit applied to the uniform 
distrib ution was developed by the first time by iNevmar] 
and his work is th e starting point of the work of 
IRavner fc Besli (Il989lll990|) . If we want to test whether f(x) 
is a uniform distribution in the interval [0, 1], we define z = 
2x — 1 and then we test if z is a uniform variable in [—1,1]. 
In this case we take the Legendre polynomials: Po(z) = 1, 
Pi (2) = 2 and (i + l)P i+1 (z) = (2i + l)zPi(z) - iPi-i (z) for 
i > 1. Thus the normalized functions are hi(z) — Pi(z)/si 
with Si = (2i + 1) _1//2 . In this case: 



Ul = 

ui = 

ui = 

ui = 



Sn(fn) 
45n(fi2 
7n(5/*3 — 
9n[35(/i4 



l/3) 2 /4 
3Ai) 2 /4 
1/5) - 30(/} 2 - l/3)] 2 /64 



where jX a = QJ - =1 z")/n. As in the Gaussian variable case, 
we see that the test is directional: deviations in the esti- 
mated moments ji a from the expected values for a uniform 
distribution give deviations in the statistics. 



3 SMOOTH GOODNESS-OF-FIT TESTS AND 
SIGNAL-TO-NOISE EIGENMODES: 
APPLICATION TO INTERFEROMETER 
OBSERVATIONS 

If we assume a small field size (where the flat-sky approxima- 
tion of the observed region is valid) , an interferometer mea- 
sures complex visibilities at a frequency v (the van Cittert- 
Zernicke theorem): 



V(u,u) — / P(x, v)B{x, v) exp {i2ixu ■ x)A x, 



(3) 



where x is the angular position of the observed point on the 
sky and u is a baseline vector in units of the wavelength 
of the observed radiation ( monochromatic radiation is as- 
sumed, and hereinafter we omit the dependence on the fre- 
quency, v). P(x) is the primary beam of the antennas (nor- 
malized to P(0) = 1), and the function B(x) is the bright- 
ness distribution on the sky, which can be easily translated 
into temperature contrast values AT(x)/T for the case of 
the CMB. 

If we denote by P the Fourier transform of P and by 
C(u) the CMB power spectrum (in units of brightness, u — 
\u\) then the correlations between the visibilities observed 
at Ui and Uj are given by 



<V(u<)V>j)) 



P(ui — u)P*(uj — u)C(u)d 2 u 
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{V{Ui)V{Uj)) = J P(u l ~u)P{u J +u)C(u)A 2 u (4) 

where P* denotes the complex conjugate of P. 

These q uantities can be computed semi-analytically as 
described in lHobson fc Maisingerl (12002ft . who show how to 
perform a maximum-likelihood analysis to extract the CMB 
power spectrum from interferometer observations. Note that 
for the common case in which the primary beam can be ap- 
proximated by a Gaussian function P(x) = exp(— \x\ 2 /2a 2 ), 
it can be demonstrated that the real and the imaginary 
parts of the observed complex visibilities are uncorrelated 
and thus independent, if the Gaussianity of the CMB holds. 

Our aim is to test if the visibilities are Gaussianly dis- 
tributed. This can be done by analysing the real and imag- 
inary parts of the visibilities or studying their phases. The 
following subsections explain how we work with the data. 

3.1 Signal-to-Noise Eigenmodes 

We work with signal -to-no i se eig enmodes as they are de- 
fined in the work of Bond (1995). Let us suppose an ob- 
served variable Afc, where the subscript k denotes a pixel 
or a position in the space where that variable is defined 
(for example the real or the imaginary parts of the visibility 
V(uk) are measured in the position Uk of the called UV 
space). This variable is the sum of a signal and a noise 
component (Afc = Sk + Uk) whose respective correlation 
matrices have components given by C St kk' = ( s fe s fc'} an d 
Cn,kk' — {nkn k i). The brackets indicate average to many 
realizations. The signal-to-noise eigenmodes are defined by 

& = ^(RL-XA,,, (5) 

p 

where L n is the called square root matrix of the noise cor- 
relation matrix (i.e. Cn = L n L^) and R is the rotation ma- 
trix which diagonalizes the matrix L„ C^L"*. The eigenval- 
ues of this diagonalization are denoted by Ek- In the case 
we are studying, the noise has zero mean and is not corre- 
lated, i.e. {rikny) — o\o~kk' ■, so the components of L n are 
L„,kk' = °~k5kk>- 

Eq. gives us transformed signal and noise s k and 
h k such that: £ fc = s fe + n fe with (s fe s fc /) = E k S k k> and 
{fikhk') = 8 k k> ■ So, the correlation matrix of is given 
by C e = L ( L\ with L iM ' = (E k + l) 1/2 5 kk> . We now have 
a clear characterization of the signal-to-noise relation of our 
(transformed) data. The noise dominates over the signal if 
Ek < 1. So signal-to-noise eigenmodes with a very low 
value of the associated eigenvalue Ek are very much domi- 
nated by the noise and we do not want to include them in our 
analysis. This is the main interest of this approach, but there 
is another interesting point: the signal-to-noise eigenmodes 
are uncorrelated data. This latter fact will be very useful in 
the application of the tests of Gaussianity described in this 
paper as explained in the next subsection. 

3.2 Real and Imaginary Parts 

We write the visibilities in terms of their real and imaginary 
parts: 

V( Ul ) = vf + wl 



Testing the Gaussianity of the visibilities is equivalent 
to testing the joint Gaussianity of their real and imaginary 
parts. As indicated above, when the primary beam is Gaus- 
sian, the real and imaginary parts are independent if the 
Gaussianity of the CMB holds. 

Our data to be analysed are the set of real parts of 
the visibilities which are correlated among them (that is: 
(V^Va ) ^ 0) and imaginary parts which also are correlated 
{{V/Vj} 7^ 0). Moreover, we only have one realization. The 
test here presented works with a large amount of indepen- 
dent data (see the hypothesis in the deduction of the score 
statistic, section 2). To convert our correlated (and then de- 
pendent) data to a sample of independent data we proceed 
in the following way. Given the real part of the visibilities, 
we perform the transformation in signal-to-noise eigenmodes 
(expression |SJ and define the variable 

y* = J2 L ^ = ^/(E k + i) 1/2 . (6) 

j 

These new data are uncorrelated and normalized (zero 
mean, unit dispersion). We operate in the same way with the 
imaginary parts and add the resulting quantities to those 
obtained with the real parts. Finally, we have the yt data 
(i = 1, . . . , 2 x number of visibilities) which are uncorrelated 
and normalized. Moreover, if the visibility distribution is 
multinormal then the yi data are independent and Gaussian 
distributed with zero mean and unit dispersion. Then, if 
Gaussianity holds, the yi data fulfill the hypothesis of the 
smooth tests of goodness-of-fit and their U 2 statistics must 
have a xi distribution. The test is then applied to the yi 
quantities. 

3.3 Phases 

The analysis can be performed on the visibility phases. It 
is well-known that if the visibility, V(ui), is Gaussianly dis- 
tributed, its phase has a uniform distribution. The phases of 
the visibilities are not independent because there are corre- 
lations of the real parts among themselves, and among the 
imaginary parts (the real and imaginary parts are not corre- 
lated if the primary beam is a Gaussian function). Thus, and 
for the case of a Gaussian primary beam, given two points 
ui and «2 in the UV plane, the joint probability of finding 
the phase value ft in Hi and 02 in 112 is given by 

where: 



E = 


(AC - B 2 )(AC - D 2 ) 


F = 


{2tx)- 2 E- 1/2 


A = 


ay??) = {{vif) 


B = 




C = 


({vif) = {(v 2 ') 2 ) 


D = 


{vlvi) 


ct = 


C( AC - D 2 cos 2 0i - B 2 sin 


P = 


A(AC - D 2 cos 2 2 - B 2 sin 


7 = 


B(AC - D 2 ) cos 0i cos 02 + 
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Figure 1. Joint probability of finding the values of the phases 9i 
and 6>2 in the positions u± and 1x2. 



Figure 2. The VSA extended array since October 2001 in its 
extended configuration. There are fourteen antennas mounted at 
an angle of 35° to a tilting table hinged along its northern edge. 



D{AC - B 
K3 = h arctan 



) sm Hi sin t/2 

7 

(a/3-7 2 )V2 



+ 



( a /3- 7 2 )V 2 7 
a/3 



Thus P(0 1 ,0 2 ) / P(0i) ■ P(<9 2 ) = (1/2tt) 2 « 0.0253. 
As an example, we have calculated the form of the previous 
probability for the particular case A = C = 1 and B = D — 
0.1. The plot is shown in Figure 

The statistics of subsection 2.3 cannot be applied di- 
rectly to the phases because these are not independent. 
We therefore construct a related independent quantities in 
the following way. We decorrelate the different visibilities 
so that, under the assumption of Gaussianity, they will be 
independent. To do this we proceed as in the previous sec- 
tion: we decorrelate the real parts of the visibilities. That 
transformation gives us the quantities yf . We proceed in 
the same way with the imaginary parts and obtain the as- 
sociated quantities y\. In this way, we have the complex 
number yf~ + iy\. These quantities are independent for dif- 
ferent i. We calculate the phase of each one and apply to 
them the statistics explained in subsection 2.3 for a uniform 
variable. 



4 SIMULATED INTERFEROMETER 
OBSERVATIONS 

To illustrate the method described above, in what follows 
we apply it to simulated observations with the Very Small 
Array. The VSA is a 14-element heterodyne interferome- 
ter array, operating at frequencies between 28 and 36 GHz 
with a 1.5 GHz bandwidth and a system temperature of 
approximately 30 K, sited at 2400 m altitude at the Teide 
Observatory in Tenerife (see IWatson et al.l 120031 for a de- 
tailed description). It can be operated in two complementary 
configurations, referred as the compa ct array (which pro bes 
the angular range i w 150-900; see iTavlor et alJl2003l for 
observations in thi s configuration) and the extended array 
(I = 300-1500; see lGrainee et al.ll2003l and lDickinson et alJ 
120041 for observations in this configuration). For the simu- 
lations in this paper, we will use as a template experiment 
the extended array configuration, which corresponds to the 



antenna arrangement shown in Figure [5] The observing fre- 
quency is 33 GHz, and the primary beam at this frequency 
has a FWHM of 2.1° (corresponding to a 1/e diameter of the 
aperture function of Am ~ 24A). For the VSA, the shape of 
the antenna can be approximated by a Gaussian function, so 
we can calculate the correl ation matrix using the expre ssions 
(11) and (12) presented in lHobson fc Maisingeil (120021) . 

We use a template observation corresponding to 
A r days><4 hr integration time, and we explore several values 
of A'days- The corresponding noise level will be simulated by 
adding to each visibility a random Gaussian number with 
an amplitude of ~8 Jy per visibility (one channel, one po- 
larization) in an integration time of 1 min, to reproduce 
the observed sensitivity of the VSA. Thus, in a single day's 
4 hr observation we get an rms of ~54 mjy/beam in the 
real-space map. A simulated 1 day VSA file typically con- 
tains 25000 64 s visibilities. In order to perform the analyses 
here, we procee d to bin these data into cells of a certain size, 
as described in lHobson fc Maisineerl (I2002T) . For the analy- 
ses carried out in this paper, we have used a bin size of 9A, 
where A is the wavelength of the measured signal, which is 
similar to tha t used for the power spectrum evaluation in 
iGrainge et alJ <l2003l) . We have explored different values of 
the bin size, and the number has been chosen according to 
two criteria. If the bin size is too small, the signal-to-noise 
ratio per pixel is very low. On the other hand, for large bin 
sizes, the number of binned visibilities is too small to apply 
the test. The value of 9A provides both a reasonable number 
of visibilities and good power in the detection. Nevertheless, 
we have checked that small variations in this number (using 
values from 9 to 14A) do not produce significant changes. 

Our template observation of the VSA contains a total of 
895 visibility points after binning in 9 A cells. This template 
will be used throughout the paper. 



5 GAUSSIAN SIMULATIONS 

In this section we calibrate the method by using simulated 
Gaussian observations. It can be shown that the correlation 
matrix of the real parts of the visibilities can be written 
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Table 1. Values of the mean ({U 2 )) and the standard deviation 
(<t) of the statistics U? for 10000 Gaussian CMB plus noise sim- 
ulations of our template experiment. They are compared with 
the corresponding asymptotic values (for Xi)i displayed in last 
column. 





u 2 


U 2 


U 2 


ul 


xl 


V?) 


0.9863 
1.4077 


0.9983 
1.4426 


1.0370 
1.4848 


1.0131 
1.5265 


1.0000 
1.4142 



as Cf = Lf (Lf )* with L%ij = <Ti(Ef) 1/2 R%, where E? are 
the signal-to-noise eigenvalues and R R is the rotation matrix 
defined in subsection 13.11 To generate Gaussian simulations 
with the desired correlation we start with a set yi ~ N(0, 1) 
(i — 1, . . . , number of visibilities). The real parts of the visi- 
bilities are given by Vj = . L^yj. In an analogous way 

we construct V/ . After that, a Gaussian realization of the 
noise is added to each visibility. 

Given the simulated visibilities we decorrelate their real 
and imaginary parts as explained in subsection 13.11 After 
that, we calculate the distributions of the Uf quantities (sub- 
section lT^l . As we have Gaussian simulations, if the amount 
of data is relatively large, these distributions must be very 
close to xi functions as it is explained above. The form of 
the distributions is shown in Fig. |3] and they are compared 
with Xi functions normalized to the number of simulations 
used (dashed line). We have used 10000 simulations of a 
VSA field, with a noise level corresponding to 25 x 4 hr for 
each one. This is the ty pical noise level achie ved in a single- 
field VSA observation iGrainge et al.ll200&fl . We bin these 
data using a cell size of 9A, to obtain 895 binned visibilities. 
In Table □ the mean value and the standard deviation (a) 
of the distributions is compared with the same quantities of 
the Xi distribution. 

As mentioned above, the mean value of the Uf must 
be equal to 1 independently of the number of data. From 
Table Q we see that the convergence to this value is very 
accurate. The value of the standard deviation must be y/2 
only when the number of data tends to infinity (asymptotic 
case). In the case we are simulating (895 visibilities) we see 
that the values of the standard deviation are very close to 
the Xi values (only \j\ seems to separate slightly from the 
asymptotic value). The VSA fields we are analysing have a 
number of visibilities between ~1000 and ~4000 and then 
the approximation to the x\ functions will be better than 
the case shown in Table Because of that, we can approxi- 
mate the Uf to xi distributions in a very accurate way. We 
then suppose the distributions to be xi f° r t ne Uf statis- 
tics or xi f° r f ne &k- We have applied to these simulations 
the method described for the phases and we also find xi 
distributions for the Iff statistics. The advantage of this as- 
sumption is that we do not have to make simulations to 
calculate the statistical distributions for every VSA field for 
the Gaussian or uniform null hypothesis. We have checked 
that the distribution of the U 2 statistics is also very close 
to xl functions even when the number of data is ~200. In 
the following sections we cut data with low signal-to-noise 
eigenvalues and the statistics are calculated with ~200 data; 
also in this case, the distributions of the statistics will then 
be approximated by x 2 functions. 



6 NON-GAUSSIAN SIMULATIONS 

In this section, given our template of a VSA observation, 
we generate non- Gaussian simulations of the visibilities and 
add the (Gaussian) VSA noise to them. We analyse three 
kinds of non-Gaussian simulations. First, we consider simu- 
lations created by means of the Edgeworth expansion in the 
visibility space; in second place, we analyse a cosmic strings 
simulation; finally, x 2 simulations are considered as an ex- 
ercise to estimate the power of the test to constrain the /jvl 
parameter. 

In order to characterize the power of the method in 
each of these cases, we use the "power of the test" . Roughly 
speaking, the power, p, of a given test at a certain signif- 
icance level a (0 < a < 1) is parameterized as the area 
of the alternative distribution function outside the region 
which contains the 1 — a area of distribution function of 
the null hypothesis (the Gaussian case). Thus, a large value 
of p for a prefixed small value of a indicates that the two 
distributions ha ve a small overlap, so we c an distinguish be- 
tween them fe.g. lBarreiro fc Hobsonl200ll) . For definiteness, 
in what follows we adopt for the significance levels the values 
a = 0.05/0.01 (i.e. significance levels of 5% and 1% respec- 
tively) , so we shall quote these pair of values for each one of 
the cases considered. 



6.1 Edgeworth Expansion 

We first construct simulations with a certain degree 
of non-Gaussianity by using the Edgeworth expansion 
l|Martfnez-Gonzalez et all2002f) . We then analyse these sim- 
ulations and calculate the power of the test to discriminate 
between a Gaussian distribution (the null hypothesis) and 
a distribution with skewness and kurtosis injected via the 
Edgeworth expansion (the alternative hypothesis). The aim 
is to quantify which signal-to-noise level is required to de- 
tect a certain degree of non-Gaussianity in the data with 
our method. 

We have two options in preparing these simulations: we 
can inject the skewness and kurtosis either in real space or in 
visibility space. If we include the non-Gaussianity through 
the Edgeworth expansion in real space, this non-Gaussianity 
is diluted when we transform to the Fourier (or visibility) 
space (this is illustrated in Appendix A). Thus, we decide to 
inject the skewness/kurtosis directly in Fourier space, and 
calibrate our method in terms of the ability to detect de- 
viations of the moments of the signal-to-noise eigenmodes 
of the visibilities with respect to the Gaussian case. A sim- 
ilar approach has been adopted by other authors (e.g. in 
iRocha et ail 1200 J they explore the non-Gaussianity of the 
visibilities simultaneously to the estimation of the power 
spectrum, by using a modified non-Gaussian likelihood func- 
tion). 

To use non-Gaussian simulations with given skew- 
ness/kurtosis in Fourier space generated from the Edgeworth 
expansion is not motivated by any physical model, but it 
is a reasonable approach to calibrate the power of a non- 
Gaussianity test because these non-Gaussian maps are easy 
to prepare with specified statistical properties. Moreover, 
systematic effects could introduce a non-Gaussian signal in 
the visibilities which could be mimic with the Edgeworth 
expansion. The probability distribution of these systematic 
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Figure 3. Distributions of the Uf statistics, from left to right, top to bottom i = 1,2,3,4. They are obtained from 10000 simulated 
observations of a single-field observation with the VSA, consisting of a Gaussian CMB signal plus a noise level corresponding to 25 X 4 hr 
integration. The visibilities are binned in cells of 9A, and the number of visibilities in each simulation is equal to 895. The dashed line is 



a x\ function normalized to the total number of simulations. 



effects could be, for example, asymmetric and, in a first ap- 
proximation, this fact can be modeled by a distribution with 
a non-zero skewness. In addition, the distribution could be 
more peaked or flat that the Gaussian one, and that can be 
modeled by a distribution with a non-zero kurtosis. In this 
manner the Edgeworth simulations are taken as a bench- 
mark to quantify the power of our method to detect some 
kind of non-Gaussian signals. 

These non- Gaussian simulations are generated as fol- 
lows. We generate a realization of independent values yj 
with zero mean, unit variance, and skewness and kurtosis 
S and K, respectively. For definiteness, we adopt here the 
values S = K — 1. As in subsection [S] the real parts of the 
visibilities are = Y2j L^ijVj- The imaginary parts are 
generated in an analogous way. After that, we add different 
noise levels according to a VSA observation. We then decor- 
relate the data of the simulations and calculate the power 
of the tests. 

Again, we use the same VSA template observation, 
binned into cells of side 9A to study the power of our tests. 
Figure 2] shows the logarithm of the eigenvalues associated 
with the real visibilities (those resulting from the diagonal- 
ization of C^ij/(<ncrj)) and those associated with the imag- 
inary parts (the diagonalization of Cl^ / (oiOj)). Only the 
23% of the data have an eigenvalue signal-to-noise higher 
than 0.01; that is, a signal-to-noise ratio higher than 0.1. 
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Figure 4. Eigenvalues E^ obtained from the template observa- 
tion used in Figure [3] We plot the values associated with both 
the real and imaginary parts of the visibilities (see the text for 
details). We have 895 X 2 eigenvalues, only 420 of which have 
a value above 0.01, which corresponds to a signal-to-noise of 
(0.01) 1 / 2 = 0.1. 



First, we consider the case of a low signal-to-noise ratio, 
and we adopt the noise level s corresponding to a single-field 
VSA observation (25 x 4 hr. lGrainge et alJl2003Tl . We anal- 
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yse the non- Gaussian simulations with different cuts in the 
eigenvalues; that is, we include in the analysis only those 
eigenmodes whose eigenvalue is higher than a value E cut . 
In Table [5] is shown the power of the E/f, E/f, E/f and E/f 
statistics to discriminate between a Gaussian and a non- 
Gaussian distribution. It can be demonstrated that the sim- 
ulations are constructed such that t/; has zero mean and unit 
variance, so only the E/f and E/f distributions are notably 
different from a xi distribution (see section 12:21 . In fact, 
E/f also deviates slightly from a \\ distribution when the 
analysed data have some degree of kurtosis (E/f is related 
to fi2, i-e. to the fourth power of the data), but, as we can 
see in Table [5] this deviation is very low. The corresponding 
probability function is shown in Fig. (dotted line) and is 
compared with a Gaussian function (dashed line). We have 
used 5000 simulations in this computation. Note that al- 
though the input skewness and kurtosis is 1.0, the output 
simulations have a mean skewness equal to 0.96 ±0.07 and a 
mean kurtosis equal to 0.85±0.32; i.e. we are analysing some 
simulations with kurtosis ~ 0.5. So we expect less power in 
the detection of the kurtosis that in the detection of the 
skewness. The first column in Table [5] indicates the min- 
imum value of the signal-to-noise eigenvalues that we use 
to perform the analysis. We indicate the square root of the 
latter quantity (signal-to-noise ratio) in parentheses. The 
second column is the number of eigenmodes that we use to 
calculate the statistics; that is, the number of eigenvalues 
such that Et > E cu . Finally, the last columns indicate the 
power of the different statistics (in percentages) when the 
significance level is 5%/l%. We see that our test is not able 
to detect non-Gaussianity when every eigenmode is included 
in the analysis. When eigenmodes with low signal-to-noise 
ratio are excluded the power is improved but the values are 
very poor. This tells us that the noise level of a single VSA 
pointing (25 x 4 hr) is too high to detect the non-Gaussian 
simulations we have constructed via the Edgeworth expan- 
sion. 

We therefore now explore the case of an integration 
time of 25 x 25 x 4 hr, which is comparable to the whole 
signa l-to-noise level a chieved in the dataset presented in 
iDickinson et alJ i2004f) . In this case, the noise levels are re- 
duced by a factor 5 with respect to the previous case. The 
results are shown in Table [3] As is to be expected, the de- 
tection of non-Gaussianity is better. For example, we could 
detect our non-Gaussian simulations with a power equal to 
~99% (for the E/f statistic) with the signal-to-noise achieved 
during 2500 hours and analysing eigenmodes with eigenval- 
ues higher than 0.4 or 0.5. In this last case the distribution 
of all the data (every eigenmode) for one simulation is shown 
by the histogram in Figure^] We see that this distribution 
is closer to a Gaussian (dashed line) than the initial one 
(dotted line) for two reasons: first, the loss of skewness and 
kurtosis by the construction of the Edgeworth simulations 
(the actual skewness and kurtosis of the simulations is lower 
than the input values, see above) and, second, by the addi- 
tion of Gaussian noise. When we only plot the autovalues 
higher than 0.4 or 0.5, the histogram is closer to the dotted 
line; that is, we see the non-Gaussianity. 

In the tables shown in this section we have presented 
only the power of the E/f statistics. In the case we are study- 
ing, 53 has less power than E/f because S3 = U( + E/f + E/f 
also has information about E/f and E/f, which do not have 
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Figure 5. Dotted line: distribution function created by the Edge- 
worth expansion with S = 1.0 and K = 1.0. Dashed line: Gaus- 
sian distribution. Histogram: distribution obtained from a sin- 
gle simulation (signal plus noise) involving all the eigenvalues, 
for a noise level corresponding to a 25 X 25 X 4 hr integration. 
The three distributions have zero mean and unit variance. The 
non-Gaussianity in the single simulation is hardly seen in its his- 
togram; however, it is easily detected when we used our method, 
as shown in Table 131 



power and compensate for the power of E/f. However, S3 
is better than the E/f statistic if we also have detection of 
non-Gaussianity in E/f and/or E/f. For the same reason, S4 
is slightly better than E/f because it has combined informa- 
tion about E/f and E/f . In our case, however, 5*4 is not better 
than E/f , so we show only the E/f statistics. 

Finally, we have analysed the Edgeworth simulations 
with the test applied to the phases (subsection 13.31 . We 
have found that the non-Gaussianity of this kind of simu- 
lation is not detected with the phases method; that is, the 
phases are compatible with a uniform distribution. A deeper 
analysis of the phases shows that the moments fi a (section 
12 .31 are sligthly different from the values corresponding to a 
uniform distribution. However, the quantity, n of data with 
high signal-to-noise eigenvalues is not large enough to give 
values of the E/f statistics that differ sufficiently for the val- 
ues obtained from a uniform distribution, so the power of 
the test has very low values. (The distribution for the phases 
can be calculated analytically by assuming, for example, a 
distribution for the real and imaginary part s given by equa- 
tion (25) in iMartmez- Gonzalez et alJl2002l . We have found 
this ditribution to oscillate about the uniform one, but the 
moments are very close to those of the uniform distribution.) 



6.2 Detectability of Cosmic Strings 

In this section, we probe if our method is able to detect 
the non-Gaussian signature introduced by cosmic strings. 
To this end, we apply our meth od to the analysis o f the 
string simulation shown in Fig. IBI feouchet et al.lll98Sl) . We 
first analyse the simulation without noise. In this way we can 
learn what happens when we have only the string map. After 
that we add different noise levels to see how the method is 
able to detect the strings. 

Although in theory the correlation matrix depends only 
of the power spectrum and the beam (see expression the 
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Table 2. Power of U^, J7|, U 2 and U? to discriminate between a Gaussian 
and a non-Gaussian distribution obtained from the Edgeworth expansion with 
input skewness and kurtosis equal to 1.0 (see the text for a discussion about 
the actual skewness and kurtosis of the simulations). The template observa- 
tion corresponds to an integration time of 25 X 4 hr. The first column shows 
the value used for E cut , and the second column displays the number of re- 
maining visibilities above the previous threshold. For each statistic is shown 
the power (as a percentage) when the significance level is 5%/l%. 



E cut (VE cut ) 


Num. 


u 2 


U 2 

2 




U 2 


0.0 (0.0) 


1790 


5.10/1.12 


4.92/1.06 


9.06/2.46 


6.44/ 1.90 


0.1 (0.32) 


188 


5.12/0.82 


5.96/1.54 


26.20/13.26 


12.30/ 7.48 


0.2 (0.45) 


133 


5.08/0.92 


6.68/1.70 


31.26/17.68 


13.76/ 8.76 


0.3 (0.55) 


107 


4.84/1.04 


6.38/1.74 


32.06/18.46 


13.70/ 9.70 


0.4 (0.63) 


89 


5.06/0.94 


6.56/2.14 


33.64/20.22 


15.02/10.90 


0.5 (0.71) 


76 


5.82/1.10 


7.14/2.30 


34.22/20.56 


14.82/10.94 



Table 3. Power of U 2 , (7|, C/g and C/| to discriminate between a Gaussian 
and a non-Gaussian distribution obtained from the Edgeworth expansion with 
input skewness and kurtosis equal to 1.0 (see the text for a discussion of the 
actual skewness and kurtosis of the simulations). The template observation 
corresponds to an integration time of 25 X 25 X 4 hr. The columns have the 
same meaning as in Table l2j 



E cut (VE cut ) 


Num. 


U 2 


U 2 


U 2 


U 2 


0.0 (0.0) 


1790 


5.38/0.86 


5.52/1.22 


57.96/36.08 


18.20/10.04 


0.1 (0.32) 


513 


5.62/1.22 


6.48/1.62 


95.92/88.62 


35.58/25.64 


0.2 (0.45) 


441 


5.94/1.52 


7.36/1.82 


97.66/91.94 


36.24/26.82 


0.3 (0.55) 


402 


5.60/1.08 


7.60/2.22 


98.26/93.28 


37.42/28.52 


0.4 (0.63) 


371 


5.90/1.54 


7.52/2.02 


98.72/94.28 


38.56/29.68 


0.5 (0.71) 


245 


5.52/1.40 


7.80/2.38 


98.82/94.44 


39.24/30.80 



(finite) pixel size of the real string map we are analysing 
and the irregular coverage of the visibility plane could in- 
troduce noticeable imprecisions. The correlation matrix of 
the strings is then calculated using Gaussian maps simulated 
with the same pixel size: we calculate the power spectrum of 
the string real map and construct Gaussian real maps with 
this power spectrum (we use 40000 Gaussian simulations). 
From these Gaussian maps we calculate the corresponding 
visibility values (expression[3J. The correlation of these visi- 
bility simulations is that used for the analysis of the strings. 
Note that in the expression @ there is no hypothesis con- 
cerning the statistic of the temperature field. The only hy- 
pothesis is the homogeneity of the field. 

The template observation of the VSA is the same as in 
previous sections (895 binned visibilities in 9 A cells). 



6.2.1 Noiseless Map of Strings 

In this case we diagonalize the correlation matrix of the sig- 
nal because we do not have noise. With this diagonalization 
we decorrelate the data in a similar way as is done in sections 
03 and OH 

We analyse the real and imaginary parts of the visibili- 
ties obtained from the string map without noise. The values 
found for the statistics are: Ul ~ 1.4 ■ 10 3 , U% ~ 3.4 ■ 10 8 , 
Ui ~ 2.9- 10 13 and £/f ~ 4.4- 10 18 . The estimated moments, 
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Figure 6. String map analysed in this paper. 
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Figure 7. Values of yi associated with the real part of the vis- 
ibilities for an observation of the string map presented in Fig. [fj] 
using our VSA template without adding noise. 



Figure 8. Same as in Fig. [T] but for a simulated Gaussian CMB 
map. 



fa, are fa ~ -0.89, fa ^ 6.2 ■ 10 2 , fa ~ -3.1 ■ 10 5 and 
fa — 2.5 • 10 s . These values are clearly incompatible with 
a Gaussian realization. The yi quantities (see expression |SJ 
associated with the real parts of the string visibilities are 
shown in the Fig. Q (those associated with the imaginary 
parts have similar features). Every yi has an associated sig- 
nal eigenvalue Ei whose value decreases with i. We see, then, 
that for the strings the absolute value (or amplitude) of yi 
grows when Ei decreases (this explains why the moments 
fa. are so large). This feature shows that the yi are not well 
decorrelated or normalized. 

We analyse a Gaussian map with the same procedure. 
Figure |H| shows the resulting yi quantities for this Gaussian 
case. We see that they have a width of order of unity. Thus, 
we conclude that the behaviour of Fig. Q is a feature of the 
non-Gaussianity of the string signal. The fact that the yi 
quantities are not well decorrelated or normalized indicates 
that we are not able to estimate their correlation matrix ac- 
curately. This seems to be because we have a model with a 
very large cosmic variance, and the correlation matrix (or 
equivalently the power spectrum) cannot be well estimated 
from only one realization. Moreover, we are estimating the 
power spectrum with the expression that maximizes the like- 
lihood und er the hypothesis of Gaussianity for the temper- 
ature field feond et al.ll200oTl . But the temperature field is 
not Gaussian for the strings, and so the estimated power 
spectrum for the strings (that which maximizes the likeli- 
hood in this case) can differ from the one we are using. For 
a study of the dependence of th e power spectrum on non- 
Gaussianity see I Amendolal i2002h . 

In the Gaussian case, however, we are able to calculate 
the power spectrum and then we decorrelate and normalize 
the visibilities properly. In this way, the yi quantities have 
a width equal to unity, i.e. (j/ 2 ) = 1, as shown in Figure |H] 
Summarizing, when there is no noise, in principle we should 
detect the strings by means of all the Vf statistics (even (7 2 
and Ui) because their values are not compatible with a \i 
distribution (see Table 0. 

Finally, we have applied the smooth tests of goodness- 
of-fit developed by Neyman to the phases of the visibilities 
of the noiseless map of strings (section f3. 31 . The Uf values 
are: U\ = 3.64 (94.36%), C/f = 4.59 (96.78%), Uf = 0.32 



Table 4. Power of the U? (i = 1,2,3,4) statistics for simulated 
observations of the strings map using a noise level corresponding 
to an integration time of 25 X 4 hr. The columns indicate the 
power in percentage when the significance level is 5%/l%. 



^cut 


ul 


U 2 


r/ 2 




0. 


5.16/0.82 


5.72/1.20 


4.92/1.00 


4.50/0.98 


0.1 


3.72/0.58 


6.00/0.86 


2.28/0.36 


1.70/0.28 


0.2 


3.10/0.30 


4.16/0.44 


1.70/0.24 


1.20/0.34 


0.3 


5.10/0.66 


3.36/0.22 


1.48/0.26 


1.24/0.30 


0.4 


1.46/0.14 


1.78/0.12 


1.08/0.36 


1.28/0.30 


0.5 


1.00/0.10 


2.04/0.08 


0.80/0.18 


0.90/0.22 



(42.84%) and Uj = 0.13 (28.16%). And for the Si statis- 
tics: Si = 3.64 (94.36%), S 2 = 8.23 (98.37%), S 3 = 8.55 
(96.41%) and S 4 = 8.68 (93.04%). The probability of obtain- 
ing a lower or equal statistic value under the null hypothesis 
(the visibilities are Gaussian) is showed in parentheses. 

In the following subsection we discuss the effect of Gaus- 
sian noise on the detectability of strings with our method. 

6.2.2 String map with different noise levels 

The features in Fig. Q change when (Gaussian) noise is 
added. The behaviour for low values of i remains unchanged 
because these values are associated with high signal-to-noise 
values so that these data are dominated by the signal. How- 
ever, the behaviour for high i is dominated by the noise 
(low signal-to-noise eigenvalues) so that the behaviour will 
resemble that of a Gaussian signal. 

The analysis with different noise levels is given in Ta- 
ble 2] (noise corresponding to an integration time equal to 
25 x 4 hr) and Table |5] (noise corresponding to an integra- 
tion time equal to 25 x 25 x 4 hr). We have analysed 5000 
simulations involving the string simulation plus a noise real- 
ization. The analysis is done for different cuts of the eigen- 
values (E cut ). The power to distinguish between a Gaussian 
distribution and the strings plus noise when the significance 
level is 5%/l% is shown in the tables. 

The results in Table [I] indicate that the VSA noise of a 
single field observed during 25 x 4 hrs is too high for detect- 
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Table 5. Power of the U? (i = 1,2,3,4) statistics for simulated 
observations of the string map using a noise level corresponding 
to an integration time of 25 X 25 X 4 hr. The columns indicate the 
power in percentage when the significance level is 5%/l%. 



^cut 




U 2 


U 2 




U 2 


0. 


5.30/0.98 


24.26/8.44 


4.20/0 


86 


4.28/1.04 


0.1 


6.56/0.84 


79.56/49.32 


1.58/0 


12 


1.30/0.22 


0.2 


7.18/0.92 


86.72/59.60 


1.08/0 


08 


1.24/0.34 


0.3 


2.94/0.26 


87.62/60.44 


0.80/0 


04 


0.70/0.10 


0.1 


3.74/0.10 


90.74/64.16 


0.48/0 


02 


0.66/0.08 


0.5 


3.20/0.12 


92.90/69.58 


0.48/0 


08 


0.38/0.08 



ing the strings with our method. When the integration time 
is multiplied by 25 (Table|5J we start to detect the strings by 
means of the U£ statistic. It is important to note that, even 
if the data were well decorrelated, the [/f statistic would 
also be a indicator of non-Gaussianity. For example, given 
realizations of normalized and independent data with kur- 
tosis K 0, the mean value of £/f is equal to (K + 2)/2 7^ 1. 
Another examp le of the use of U£ to detect non-Gaussianity 
can be found in lAliaea et ail j2003bl) in the formalism of the 
multinormal analysis. 

As a test, we repeat the same procedure using a Gaus- 
sian simulation instead of the string map and, as expected, 
we obtain a negligible power. 

6.3 \ 2 simulations 

Nowadays, non-Gaussian simulations constructed by the ad- 
dition of a Gaussian field and its square have acquired a 
notable relevance because they could represent perturba- 
tions produced in sever al inflationary scena rios (for a re- 
view on the subject, see iBartolo et, alJl2004l and references 
therein). The parameter which measures the coupling with 
the non-linear part is denoted by /nl and, then, an analysis 
to study the power of our method to detect this parameter 
seems to be very appropriate. Realistic simulations in the 
flat-sky approximatio n would require to develop appropri- 
ate software (see e.g. iLieuori et al.1120031 for simulations on 
the full sphere), and this is beyond the scope of the present 
work. However, we can estimate the power of our method in 
detecting a non-zer o fNL component usin g a simple approx- 
imation. We follow lAghanim et alJ i2003fi . and we generate 
what they call "x 2 maps" by assuming that the observed 
temperature contrast field on sky (A(cc) = AT(x) /T) can 
be written as 

A(x) = A L (x) + f NL (A 2 L {x) - (Al(x)» 

where Al is the Gaussian (linear) component. In reality, 
this app roximation is only valid in the Sachs- Wolfe regime 
(see e.g. iKomatsu fc SDergelll200lTl . but we use it as a toy 
model in our exercise of the study of the detection of the 
/nl parameter 1 . 

1 Note that in the Sachs- Wolfe regime, A(x) = <3>(a;)/3, so the 
definition of the non-linear coupling parameter in our \ 2 model 
gives 4 (as) = <5> L (x) + (/ NL /3 )(<S> 2 (x) - (<5> 2 (x))) There fore, 
the coupling parameter used bv lKomatsu &c Spergell J200ll) and 




Figure 9. Distributions of [/J for the Gaussian case (dashed 
boxes) and for the non-Gaussian case with /jvz, = 10000 (solid 
boxes). The solid line is a \ 2 function normalized to the number 
of simulations (10000 simulations). (For the sake of clarity, the 
histogram for the Gaussian case has been cut at the top.) 



We first consider the ideal case where there is no in- 
strumental noise, and we have analyzed different sets of 
simulations with /jvl values (Gaussian case), 100, 1000 
and 10000. We have run 10000 simulations for every case. 
Only the case with /jvl = 10000 has been detected via 
the U£ statistic. Fig. |U] shows the distribution function of 
the J/J statistic for the Gaussian case (dashed boxes), com- 
pared with the corresponding one for the non-Gaussian case 
with /jvl = 10000 (solid boxes). From here, we infer that 
the power of this statistic to discriminate between the non- 
Gaussian simulations and the Gaussian ones is about the 
79% (72%) with a significance level of 5% (1%). Analysing 
the same previous simulations but adding the noise level 
corresponding to an observation time of 25 x 25 x 4 hr, we 
find that the power of our statistic is reduced roughly by a 
fact or 2. 

ISmith et alJ i2004T) constrained the /nl parameter by 
using the bispectrum of the VSA extended array data (note 
that this is exactly the same configuration that we have 
adopted to build our visibility template). With their defini- 
tion of fNL, they found an upper limit of 7000 with a con- 
fidence level of 95%. In the Sachs- Wolfe regime, this would 
correspond to a value of 31500 according to our }nl def- 
inition. However, this value can not be direclty compared 
with the one obtained in our work, because our model does 
not correspond to the realistic case which was considered in 
ISmith et alJ <2004) . 



7 DISCUSSION AND CONCLUSIONS 

In this paper we have presented a method of searching for 
non-Gaussianity in data from interferometric CMB observa- 
tions, directly in the visibility space. This method can be 
adapted to other interferometric experiments (e.g. CBI and 
DASI) if we know the correlation matrices of the signal and 
of instrumental noise. Note that in this paper, we have dealt 

ICavon et ail i2003al) is related wi th our definition as /jvl/3, and 
the one used bv lSmith et alJ i2004l) is 2f NL /9. 
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with decorrelated noise (which is usually the case for these 
experiments) , but the case of correlated noise can be studied 
in an analogous way. 

We have applied the method to work with the real and 
imaginary parts of the visibilities. The method tests whether 
they are Gaussian distributed. However, we have applied the 
method to the phases of the visibilities to test whether they 
are uniformly distributed, but we found that it is not very 
sensitive for detecting the kind of non-Gaussianity we have 
analysed here. 

We have integrated the signal-to-noise formulation into 
the smooth goodness-of-fit tests. In this way we can deal 
only with the data dominated by the signal we want to 
analyse. In the text it is noted that the correlation ma- 
trices of the signal-to-noise eigenmodes and of the signal 
are descomposed as the product of a matrix and its trans- 
pose (subsection 13.11 and section |KJ . This decomposition is 
analogous to that of Cholesky. This latter decomposition is 
computationally faster than the one used here; however, the 
decomposition we use allows us to deal with better quality 
data, that is, with a higher signal-to-noise ratio. The analy- 
sis with the Cholesky decomposition takes every eigenvalue; 
that is, E cut = 0. 

It is important to stress that smooth goodness-of-fit 
tests do not require the data to be on a regular grid, but 
can be applied to any data set. In that sense the test is 
perfectly adapted to interferometric data because the cov- 
erage of the UV plane is neither regular nor complete. The 
method could therefore also be applied by selecting those 
visibilities in a certain UV range, thus allowing study of 
the non-Gaussianity as a function of angular scale. In ad- 
dition, although the method has been presented here as a 
tool to study the non-Gaussian properties of the sky signal, 
it could also be used as a powerful diagnostic for detecting 
systematics in the data, as we pointed out in Section 6.1. 
For short integration periods, a stack of visibilities will be 
dominated by instrumental noise, so this method could be 
used to trace the presence of spurious signals in the data 
(e.g. those comming from cross-talk between antennas), or 
to study the correlation properties of the noise. 

Summarizing, to study the power of our method in 
detecting non-Gaussian signals on the sky, we have anal- 
ysed three kinds of simulations. First we have analysed non- 
Gaussian visibilities created by inserting some degree of 
skewness and kurtosis (S = K = 1) with the Edgeworth 
expansion directly in Fourier space. Using the VSA as a ref- 
erence experiment, we have shown the performance of the 
method in detecting those levels of non-Gaussianity in the 
data with realistic values of the integration time for this ex- 
periment. All these results can be easily adapted to other in- 
struments, just by rescaling the integration times according 
to the square of the ratio of the different sensitivities. In ad- 
dition, we have also shown the performance of the test in the 
detection of the non-Gaussian signal introduced by cosmic 
strings. Even though those kind of signals a re usually de- 
tected using real-space statisti cs or wavelets llHobson et alJ 
ll999l:lBarreiro fc Hobsonll200 ll) . we have demonstrated that 
the signal-to-noise eigenmodes approach allows us to detect 
them because the method is very sensitive to the charac- 
terization of the covariance matrix. In this particular case, 
due to the non-Gaussian nature of the strings, a complete 
decorrelation can not be achieved and the statistics show 



huge deviations with respect to the Gaussian case. For the 
case of the VSA, this translates into the fact that cosmic 
strings can hardly be detected in single-field observations 
(integration times of 25 x 4 hr), but they could be detected 
(if present) using this method, given the current sensitivity 
achieved by the whole data set published by the VSA team 
(with a sensitivity of ~25 x 25 x 4 hr integration). Finally, we 
have studied the power of the method to detect a non-zero 
/jvi component in a toy model based on x 2 maps. 
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Note that the Gaussianity associated to the third moment 
in the Fourier space is reduced by a factor Aside compared 
to that in real space. 



APPENDIX B: SKEWNESS AND KURTOSIS 
OF A GIVEN EIGENMODE FROM THE 
EDGEWORTH EXPANSION 

When we introduce some degree of non-Gaussianity (skew- 
ness or kurtosis) by means of the Edgeworth expansion in 
the visibilities, it is clear that each eigenmode will show a 
different degree of non-Gaussianity which will depend on 
the associated signal-to-noise eigenvalue. This relation can 
be obtained as follows. 

As it is explained in the main text, we generate a real- 
ization of independent values yj with zero mean, unit vari- 
ance, and skewness and kurtosis 5* and K, respectively. 
As in subsection |5] the real parts of the visibilities are 
V/ 1 — . L^jUj. The imaginary parts are generated in an 
analogous way. After that, we add the corresponding differ- 
ent noise levels and decorrelate the data by using the trans- 
formation given by equation JSJ. Then, bearing in mind the 
definition of the Ek quantities, it can be easily demonstrated 
that for the final decorrelated and normalized data we have 

As one would expect, the skewness and the kurtosis are 
lost if the (Gaussian) noise dominates the data (Ek <C 1). 
In contrast, the skewness and the kurtosis are preserved for 
the case Ek 3> 1. In the previous expressions, the angled 
brackets ((•}) denote an average over realizations. 



APPENDIX A: BISPECTRUM OF THE 
EDGEWORTH EXPANSION 

Suppose a square real map of pixel coordinates x and pixel 
data y x . These values are generated via the Edgeworth ex- 
pansion such that they are independent, normalized and 
have skewness S. The map has Aside pixels on a side and 
the area of the pixel is A 2 . We now perform its Fourier 
transform: 

a k — A y* e 

X 

whose bispectrum is 

/ ~ ~ ~ \ a 6 X ^ / \ ik-i x-i ikoxo ik^x-i 

{a kl ak 2 a k3 ) = A ^ \V^V^iV^) e e e 

X1X2X3 

Taking into account that y x are independent and (y%) = 
S we have 

{a,k 1 a,k 2 a,k 3 ) = A s 2 idc A 6 S 8k 1 + k2 +k 3 ,o 

where <5 fel fc, = AT 2 V e^i-M* is the Kronecker delta. 

K l i K 2 side L 'x 

The Fourier modes could be normalized to the desired spec- 
trum: a k = Cy 2 a fe /(Asid c A 2 ) such that (a k al) = Ck&k 1 ,k 2 - 
Finally, the (normalized) bispectrum is given by 

> S X 

(|a fel | 2 ) 1 / 2 (|a fe2 | 2 ) 1 /2(|a fc3 P)i/2 jV sidc " fc i+^+ fe ^°- 



